{
  "cells": [
    {
      "attachments": {},
      "cell_type": "markdown",
      "id": "26e50a28",
      "metadata": {},
      "source": [
        "# Introductory tutorial\n",
        "\n",
        "This is the introduction to a seven part tutorial which demonstrates how to de-duplicate a small dataset using simple settings.\n",
        "\n",
        "The aim of the tutorial is to demonstrate core Splink functionality succinctly, rather that comprehensively document all configuration options.\n",
        "\n",
        "The seven parts are:\n",
        "\n",
        "- [1. Data prep pre-requisites](./01_Prerequisites.ipynb)\n",
        "\n",
        "- [2. Exploratory analysis](./02_Exploratory_analysis.ipynb) <a target=\"_blank\" href=\"https://colab.research.google.com/github/moj-analytical-services/splink/blob/master/docs/demos/tutorials/02_Exploratory_analysis.ipynb\">\n",
        "  <img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/>\n",
        "</a>\n",
        "\n",
        "- [3. Choosing blocking rules to optimise runtimes](./03_Blocking.ipynb) <a target=\"_blank\" href=\"https://colab.research.google.com/github/moj-analytical-services/splink/blob/master/docs/demos/tutorials/03_Blocking.ipynb\">\n",
        "  <img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/>\n",
        "</a>\n",
        "\n",
        "- [4. Estimating model parameters](./04_Estimating_model_parameters.ipynb) <a target=\"_blank\" href=\"https://colab.research.google.com/github/moj-analytical-services/splink/blob/master/docs/demos/tutorials/04_Estimating_model_parameters.ipynb\">\n",
        "  <img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/>\n",
        "</a>\n",
        "\n",
        "- [5. Predicting results](./05_Predicting_results.ipynb) <a target=\"_blank\" href=\"https://colab.research.google.com/github/moj-analytical-services/splink/blob/master/docs/demos/tutorials/05_Predicting_results.ipynb\">\n",
        "  <img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/>\n",
        "</a>\n",
        "\n",
        "- [6. Visualising predictions](./06_Visualising_predictions.ipynb) <a> <a target=\"_blank\" href=\"https://colab.research.google.com/github/moj-analytical-services/splink/blob/master/docs/demos/tutorials/06_Visualising_predictions.ipynb\">\n",
        "  <img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/>\n",
        "</a>\n",
        "\n",
        "- [7. Evaluation](./07_Evaluation.ipynb) <a> <a target=\"_blank\" href=\"https://colab.research.google.com/github/moj-analytical-services/splink/blob/master/docs/demos/tutorials/07_Evaluation.ipynb\">\n",
        "  <img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/>\n",
        "</a>\n",
        "\n",
        "- [8. Building your own model](./08_building_your_own_model.md) \n",
        "\n",
        "\n",
        "Throughout the tutorial, we use the duckdb backend, which is the recommended option for smaller datasets of up to around 1 million records on a normal laptop.\n",
        "\n",
        "You can find these tutorial notebooks in the `docs/demos/tutorials/` folder of the  [splink repo](https://github.com/moj-analytical-services/splink/tree/master/docs/demos/tutorials), or click the Colab links to run in your browser."
      ]
    },
    {
      "attachments": {},
      "cell_type": "markdown",
      "id": "33c575ca",
      "metadata": {},
      "source": [
        "## End-to-end demos\n",
        "\n",
        "After following the steps of the tutorial, it might prove useful to have a look at some of the [example notebooks](https://moj-analytical-services.github.io/splink/demos/examples/examples_index.html) that show various use-case scenarios of Splink from start to finish.\n",
        "\n",
        "## Interactive Introduction to Record Linkage Theory\n",
        "\n",
        "If you'd like to learn more about record linkage theory, an interactive introduction is available [here](https://www.robinlinacre.com/intro_to_probabilistic_linkage/)."
      ]
    },
    {
      "cell_type": "markdown",
      "id": "8c28bba7",
      "metadata": {
        "vscode": {
          "languageId": "plaintext"
        }
      },
      "source": [
        "## LLM prompts\n",
        "\n",
        "If you're using an LLM to suggest Splink code, see [here](../../topic_guides/llms/prompting_llms.md) for suggested prompts and context."
      ]
    }
  ],
  "metadata": {
    "kernelspec": {
      "display_name": "Python 3 (ipykernel)",
      "language": "python",
      "name": "python3"
    },
    "language_info": {
      "codemirror_mode": {
        "name": "ipython",
        "version": 3
      },
      "file_extension": ".py",
      "mimetype": "text/x-python",
      "name": "python",
      "nbconvert_exporter": "python",
      "pygments_lexer": "ipython3",
      "version": "3.10.8"
    },
    "vscode": {
      "interpreter": {
        "hash": "3b53fa520a31e303a9636a08ff10a3bbc14893ee50cb37445791fa59628fc75b"
      }
    }
  },
  "nbformat": 4,
  "nbformat_minor": 5
}
